70 research outputs found
Functional cartography of complex metabolic networks
High-throughput techniques are leading to an explosive growth in the size of
biological databases and creating the opportunity to revolutionize our
understanding of life and disease. Interpretation of these data remains,
however, a major scientific challenge. Here, we propose a methodology that
enables us to extract and display information contained in complex networks.
Specifically, we demonstrate that one can (i) find functional modules in
complex networks, and (ii) classify nodes into universal roles according to
their pattern of intra- and inter-module connections. The method thus yields a
``cartographic representation'' of complex networks. Metabolic networks are
among the most challenging biological networks and, arguably, the ones with
more potential for immediate applicability. We use our method to analyze the
metabolic networks of twelve organisms from three different super-kingdoms. We
find that, typically, 80% of the nodes are only connected to other nodes within
their respective modules, and that nodes with different roles are affected by
different evolutionary constraints and pressures. Remarkably, we find that
low-degree metabolites that connect different modules are more conserved than
hubs whose links are mostly within a single module.Comment: 17 pages, 4 figures. Go to http://amaral.northwestern.edu for the PDF
file of the reprin
Revisiting Date and Party Hubs: Novel Approaches to Role Assignment in Protein Interaction Networks
The idea of 'date' and 'party' hubs has been influential in the study of
protein-protein interaction networks. Date hubs display low co-expression with
their partners, whilst party hubs have high co-expression. It was proposed that
party hubs are local coordinators whereas date hubs are global connectors. Here
we show that the reported importance of date hubs to network connectivity can
in fact be attributed to a tiny subset of them. Crucially, these few, extremely
central, hubs do not display particularly low expression correlation,
undermining the idea of a link between this quantity and hub function. The
date/party distinction was originally motivated by an approximately bimodal
distribution of hub co-expression; we show that this feature is not always
robust to methodological changes. Additionally, topological properties of hubs
do not in general correlate with co-expression. Thus, we suggest that a
date/party dichotomy is not meaningful and it might be more useful to conceive
of roles for protein-protein interactions rather than individual proteins. We
find significant correlations between interaction centrality and the functional
similarity of the interacting proteins.Comment: 27 pages, 5 main figures, 4 supplementary figure
Mesoscopic organization reveals the constraints governing C. elegans nervous system
One of the biggest challenges in biology is to understand how activity at the
cellular level of neurons, as a result of their mutual interactions, leads to
the observed behavior of an organism responding to a variety of environmental
stimuli. Investigating the intermediate or mesoscopic level of organization in
the nervous system is a vital step towards understanding how the integration of
micro-level dynamics results in macro-level functioning. In this paper, we have
considered the somatic nervous system of the nematode Caenorhabditis elegans,
for which the entire neuronal connectivity diagram is known. We focus on the
organization of the system into modules, i.e., neuronal groups having
relatively higher connection density compared to that of the overall network.
We show that this mesoscopic feature cannot be explained exclusively in terms
of considerations, such as optimizing for resource constraints (viz., total
wiring cost) and communication efficiency (i.e., network path length).
Comparison with other complex networks designed for efficient transport (of
signals or resources) implies that neuronal networks form a distinct class.
This suggests that the principal function of the network, viz., processing of
sensory information resulting in appropriate motor response, may be playing a
vital role in determining the connection topology. Using modular spectral
analysis, we make explicit the intimate relation between function and structure
in the nervous system. This is further brought out by identifying functionally
critical neurons purely on the basis of patterns of intra- and inter-modular
connections. Our study reveals how the design of the nervous system reflects
several constraints, including its key functional role as a processor of
information.Comment: Published version, Minor modifications, 16 pages, 9 figure
Jerarca: Efficient Analysis of Complex Networks Using Hierarchical Clustering
Background: How to extract useful information from complex biological networks is a major goal in many fields, especially in genomics and proteomics. We have shown in several works that iterative hierarchical clustering, as implemented in the UVCluster program, is a powerful tool to analyze many of those networks. However, the amount of computation time required to perform UVCluster analyses imposed significant limitations to its use. Methodology/Principal Findings: We describe the suite Jerarca, designed to efficiently convert networks of interacting units into dendrograms by means of iterative hierarchical clustering. Jerarca is divided into three main sections. First, weighted distances among units are computed using up to three different approaches: a more efficient version of UVCluster and two new, related algorithms called RCluster and SCluster. Second, Jerarca builds dendrograms based on those distances, using well-known phylogenetic algorithms, such as UPGMA or Neighbor-Joining. Finally, Jerarca provides optimal partitions of the trees using statistical criteria based on the distribution of intra- and intercluster connections. Outputs compatible with the phylogenetic software MEGA and the Cytoscape package are generated, allowing the results to be easily visualized. Conclusions/Significance: The four main advantages of Jerarca in respect to UVCluster are: 1) Improved speed of a novel UVCluster algorithm; 2) Additional, alternative strategies to perform iterative hierarchical clustering; 3) Automatic evaluatio
Systematic Analysis of Pleiotropy in C. elegans Early Embryogenesis
Pleiotropy refers to the phenomenon in which a single gene controls several distinct, and seemingly unrelated, phenotypic effects. We use C. elegans early embryogenesis as a model to conduct systematic studies of pleiotropy. We analyze high-throughput RNA interference (RNAi) data from C. elegans and identify “phenotypic signatures”, which are sets of cellular defects indicative of certain biological functions. By matching phenotypic profiles to our identified signatures, we assign genes with complex phenotypic profiles to multiple functional classes. Overall, we observe that pleiotropy occurs extensively among genes involved in early embryogenesis, and a small proportion of these genes are highly pleiotropic. We hypothesize that genes involved in early embryogenesis are organized into partially overlapping functional modules, and that pleiotropic genes represent “connectors” between these modules. In support of this hypothesis, we find that highly pleiotropic genes tend to reside in central positions in protein-protein interaction networks, suggesting that pleiotropic genes act as connecting points between different protein complexes or pathways
Systematic Analysis of Pleiotropy in C. elegans Early Embryogenesis
Pleiotropy refers to the phenomenon in which a single gene controls several distinct, and seemingly unrelated, phenotypic effects. We use C. elegans early embryogenesis as a model to conduct systematic studies of pleiotropy. We analyze high-throughput RNA interference (RNAi) data from C. elegans and identify “phenotypic signatures”, which are sets of cellular defects indicative of certain biological functions. By matching phenotypic profiles to our identified signatures, we assign genes with complex phenotypic profiles to multiple functional classes. Overall, we observe that pleiotropy occurs extensively among genes involved in early embryogenesis, and a small proportion of these genes are highly pleiotropic. We hypothesize that genes involved in early embryogenesis are organized into partially overlapping functional modules, and that pleiotropic genes represent “connectors” between these modules. In support of this hypothesis, we find that highly pleiotropic genes tend to reside in central positions in protein-protein interaction networks, suggesting that pleiotropic genes act as connecting points between different protein complexes or pathways
Expression QTL Modules as Functional Components Underlying Higher-Order Phenotypes
Systems genetics studies often involve the mapping of numerous regulatory relations between genetic loci and expression traits. These regulatory relations form a bipartite network consisting of genetic loci and expression phenotypes. Modular network organizations may arise from the pleiotropic and polygenic regulation of gene expression. Here we analyzed the expression QTL (eQTL) networks derived from expression genetic data of yeast and mouse liver and found 65 and 98 modules respectively. Computer simulation result showed that such modules rarely occurred in randomized networks with the same number of nodes and edges and same degree distribution. We also found significant within-module functional coherence. The analysis of genetic overlaps and the evidences from biomedical literature have linked some eQTL modules to physiological phenotypes. Functional coherence within the eQTL modules and genetic overlaps between the modules and physiological phenotypes suggests that eQTL modules may act as functional units underlying the higher-order phenotypes
Semantic integration to identify overlapping functional modules in protein interaction networks
<p>Abstract</p> <p>Background</p> <p>The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms.</p> <p>Results</p> <p>We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches.</p> <p>Conclusion</p> <p>The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.</p
- …